Human activity analysis and classification using RGB-D videos
نویسندگان
چکیده
This thesis encompasses parts of the field of computer vision. The main problem dealt with throughout this project is how to automate classification of specific human activities via video streams. The essence of this project is therefore that the developed algorithm shall be able to distinguish these activities from one another. A number of restrictions were imposed on the data-set in order to keep the problem size manageable. A video snippet was assumed to contain a single human that either falls to the ground or, in a controlled way, sits and then lies down on the ground. Our approach relies heavily on background subtraction as the method for detecting foreground objects and comparing the performance of several such approaches for video pre-processing. Features are designed with a heuristic approach utilizing velocities, limb distance relationships and acknowledged techniques based on oriented gradients and optical flow. The classification model is a pre-implemented support vector machine from the libSVM package that is tuned to fit the generated features. The accuracy of our algorithm reached 93.73% which was well above the goal of 85%. On average the classification process roughly took 0.6 second per frame which concludes that the run-time of our algorithm was not fast enough to process a video stream in real time. The result of this work confirms, however, that the methods used works well in a restricted setting and produces a high classification rate. The gist is thus that the proposed method is a good classifier when used in a restricted, offline environment.
منابع مشابه
Hand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study
Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...
متن کاملLearning human activities and object affordances from RGB-D videos
Understanding human activities and object affordances are two very important skills, especially for personal robots which operate in human environments. In this work, we consider the problem of extracting a descriptive labeling of the sequence of sub-activities being performed by a human, and more importantly, of their interactions with the objects in the form of associated affordances. Given a...
متن کاملUniform Local Binary Pattern Based Texture-Edge Feature for 3D Human Behavior Recognition
With the rapid development of 3D somatosensory technology, human behavior recognition has become an important research field. Human behavior feature analysis has evolved from traditional 2D features to 3D features. In order to improve the performance of human activity recognition, a human behavior recognition method is proposed, which is based on a hybrid texture-edge local pattern coding featu...
متن کاملComparison of Machine Learning Algorithms for Broad Leaf Species Classification Using UAV-RGB Images
Abstract: Knowing the tree species combination of forests provides valuable information for studying the forest’s economic value, fire risk assessment, biodiversity monitoring, and wildlife habitat improvement. Fieldwork is often time-consuming and labor-required, free satellite data are available in coarse resolution and the use of manned aircraft is relatively costly. Recently, unmanned aeria...
متن کاملLearning Human Pose Models from Synthesized Data for Robust RGB-D Action Recognition
We propose Human Pose Models that represent RGB and depth images of human poses independent of clothing textures, backgrounds, lighting conditions, body shapes and camera viewpoints. Learning such universal models requires training images where all factors are varied for every human pose. Capturing such data is prohibitively expensive. Therefore, we develop a framework for synthesizing the trai...
متن کاملSequential Max-Margin Event Detectors
Many applications in computer vision (e.g., games, human computer interaction) require a reliable and early detector of visual events. Existing event detection methods rely on one-versus-all or multi-class classifiers that do not scale well to online detection of large number of events. This paper proposes Sequential Max-Margin Event Detectors (SMMED) to efficiently detect an event in the prese...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015